Locking in Returns: Speeding Up Q-Learning by Scaling

نویسندگان

  • Soumi Ray
  • Tim Oates
چکیده

One problem common to many reinforcement learning algorithms is their need for large amounts of training, resulting in a variety of methods for speeding up these algorithms. We propose a novel method that is remarkable both for its simplicity and its utility in speeding up Q-learning. It operates by scaling the values in the Q-table after limited, typically small, amounts of learning. Empirical results in a variety of domains, including a partially observable multi-agent domain that is exceptionally difficult to solve using standard reinforcement learning algorithms, show significant speedups in learning when using scaling.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

From Transfer to Scaling: Lessons Learned in Understanding Novel Reinforcement Learning Algorithms

A major drawback of reinforcement learning (RL) is the slow learning rate. We are interested in speeding up RL. We first approached this problem with transfer learning where we have two domains. We developed a method to transfer knowledge from a completely trained RL domain to a partially trained related domain (where we want to speed up learning) and this helped increase the learning rate suff...

متن کامل

Time manipulation technique for speeding up reinforcement learning in simulations

A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on failure events is shown to speed up the learning by 260% and improve the state space exploration by 12% on the cart-pole balancing task, compared to the conve...

متن کامل

Scaling production and improving efficiency in DEA: an interactive approach

DEA models help a DMU to detect its (in-)efficiency and to improve activities, if necessary. Efficiency is only one economic aim for a decision-maker; however, up- or downsizing might be a second one. Improving efficiency is the main topic in DEA; the long-term strategy towards the right production size should attract our attention as well. Not always the management of a DMU primarily focuses o...

متن کامل

IEESE International Journal of Science and Technology (IJSTE), Vol. 1 No. 4, December 2012

Abstract. Speeding up a synthesizer’s locking process can be considered as speeding up the charge pump PLL. Several methods have been introduced to increase the speed of the locking process. One way to achieve fast locking is to use a bangbang frequency comparator (BBFC) in the feed-trough path to achieve a faster locking process. In this paper, we present a differential equation for this fast ...

متن کامل

Speeding Up and Boosting Diverse Density Learning

In multi-instance learning, each example is described by a bag of instances instead of a single feature vector. In this paper, we revisit the idea of performing multi-instance classification based on a point-and-scaling concept by searching for the point in instance space with the highest diverse density. This is a computationally expensive process, and we describe several heuristics designed t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011